The shortest common supersequence problem in a microarray production setting

نویسنده

  • Sven Rahmann
چکیده

MOTIVATION During microarray production, several thousands of oligonucleotides (short DNA sequences) are synthesized in parallel, one nucleotide at a time. We are interested in finding the shortest possible nucleotide deposition sequence to synthesize all oligos in order to reduce production time and increase oligo quality. Thus we study the shortest common super-sequence problem of several thousand short strings over a four-letter alphabet. RESULTS We present a statistical analysis of the basic ALPHABET-LEFTMOST approximation algorithm, and propose several practical heuristics to reduce the length of the super-sequence. Our results show that it is hard to beat ALPHABET-LEFTMOST in the microarray production setting by more than 2 characters, but these savings can improve overall oligo quality by more than four percent. AVAILABILITY Source code in C may be obtained by contacting the author, or from http://oligos.molgen.mpg.de.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Problems Related to Subsequences and Supersequences

We present an algorithm for building the automaton that searches for all non-overlapping occurrences of each subsequence from the set of subsequences. Further, we define Directed Acyclic Supersequence Graph and use it to solve the generalized Shortest Common Supersequence problem, the Longest Common Non-Supersequence problem, and the Longest Consistent Supersequence problem.

متن کامل

Solving Shortest Common Supersequence Problem Using Artificial Bee Colony Algorithm

The idea behind this work is to solve the Shortest Common Supersequence by using Artificial Bee Colony (ABC) Algorithm. This algorithm is considered as one of the newest nature-inspired swarm-based optimization algorithms and has a promising performance [6]. Shortest Common Supersequence is a classical problem in the field of strings and it is classified as NP-Hard problem [12]. Many algorithms...

متن کامل

Shop Cells

In this paper the Group Technology problem of grouping parts in families and machines in cells in order to convert a job shop production system to some pseudo ow shops is considered. Several heuristic methods for solving Group Technology problem have been proposed in the literature, in general, not producing a ow shop in each cell|unless a great number of extra machines is added. We use known t...

متن کامل

R O M a Group Technology with Flow Shop Cells

In this paper the Group Technology problem of grouping parts in families and machines in cells in order to convert a job shop production system to some pseudo ow shops is considered Several heuristic methods for solving Group Technology problem have been proposed in the literature in general not producing a ow shop in each cell unless a great number of extra machines is added We use known techn...

متن کامل

Minimum cost multi-product flow lines

In this paper, the problem of finding the minimum cost flow line able to produce different products is considered. This problem can be formulated as a shortest path problem on an acyclic di-graph when the machines graph associated with each product family is a chain or a comb. These graphs are relevant in production planning when dealing with pipelined assembly systems. We solve the problem usi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 19 Suppl 2  شماره 

صفحات  -

تاریخ انتشار 2003